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PEPTIDE SEQUENCE TAGS AND METHOD OF USING SAME 

This invention relates to peptide sequence tags used to elicit antibodies, which 
can be used to detect defined families of proteins. 

Natural populations of phytoplankton include representatives of numerous 
species of cyanobacteria, diatoms, green algae and other groups. Nevertheless many of 
these species share core biochemical pathways supporting primary productivity and 
elemental cycling (Bryant, 1994; Falkowski & Raven, 1997). To assess the gross capacity 
for key metabolic transformations in, aquatic habitats, and to track acclimatory changes 
in these capacities, researchers require reagents to quantitatively detect all members of a 
functional class of enzymes (Bouchard et al., 2002; Schof ield et al., 2002), for example the 
RbcL (RUBISCO) enzyme responsible for carbon fixation in all photosynthetic 
organisms. Different members of the organism population will contain somewhat 
different versions of the RbcL enzyme, which nevertheless share similar core properties 
and shared conserved regions (ncbi.nlm.nih.gov). Conventional immunological 
detection uses an antibody raised against one particular protein from one species, which 
will then bind with variable affinity to other related proteins depending on their 
antigenic similarity to the initial target molecule (Orellana & Perry, 1992). This is 
problematic because change in immunological signals could result from: a) changes in 
the level of the targets b) changes in the population composition resulting in shifts in the 
specific mix of target molecules present or c) a combination of (a) and (b). 

Conventional antibodies are raised against two classes of protein targets; namely 
(a) purified of over-expressed protein from a particular species, and (b) a peptide 
selected to match the sequence of a region of a particular protein. Such antibodies are 
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generally raised against proteins from a model species, and show variable cross- 
reactivity to related proteins from other species. It is not practical to develop individual 
antibodies to detect each protein of interest from each strain in a population, many of 
which are poorly characterized or unknown, and even unculturable (Staley & 
Reysenbach 2002). Most of the protein families of interest in cyanobacteria and 
phy toplankton are also highly conserved in plants so that the same detection system can 
meet needs for standard antibodies in plant sciences. 

Thus, a need exists for a set of peptide targets to elicit production of a set of 
antibodies to detect key proteins involved, inter alia, in photosynthetic activity. A 
system should be able to evenly and universally recognize all members of a defined 
enzyme family or subfamily based on shared characteristics. 

The above defined need is met by the present invention which provides a 
method for detecting the presence of members of the target protein family in a sample 
comprising the steps of: 

(a) identifying and obtaining a peptide sequence tag conserved for all 
sequences of members of the target protein family, and exclusive to the target protein 
family; 

(b) assessing the tag for immunogenicity potential 

(c) utilizing the tag to elicit the production of antibodies; and 

(d) using the antibodies to measure the concentration of members of the 
target protein family in a sample. 
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The invention also provides a peptide sequence tag selected from the group 
consisting of 





cr;o td MO- Q 


bby ID I\UJ. /. 




SEQIDNO: 3 


SEQIDNO: 11 


SEQIDNO: 4 


SEQIDNO: 12 


SEQIDNO: 5 


SEQ ID NO: 13 


SEQIDNO: 6 


SEQIDNO: 14 


SEQIDNO: 7 


SEQ ID NO: 15 


SEQIDNO: 8 


SEQ ID NO: 16 



In accordance with another aspect of the invention, the invention provides a 
method for detecting the presence of a target protein in a sample comprising the steps 
of: , 

(a) identifying and obtaining a peptide sequence tag conserved for all 
members of and exclusive to a protein family; 

(b) assessing the tag for immunogenicity; and 

(c) synthesizing the tag provided it possesses a predetermined level of 
immunogenicity. 

(d) utilizing the tag to elicit the production antibodies; and 

(e) using the antibodies to provide an indication of protein concentration in a 
sample. 

According to yet another aspect, the invention provides a method of using a 
peptide sequence tag for coupling to column matrice materials for affinity purification 
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of the global antibodies produced according to the invention. 

According to yet another aspect, there is provided a method of developing 
characterized concentration standards for quantitation of the concentration of target 
proteins in samples by comparison to the concentration standards comprising the steps 
of: 

(a) coupling a defined molar quantity of protein carrier molecule to a defined molar 
quantity of peptide sequence tag selected from the group consisting of 

SEQ ID NO: 1 SEQIDNO:9 
SEQ ID NO: 2 SEQ ID NO: 10 

SEQ ID NO: 3 SEQ ID NO: 11 

SEQ ID NO: 4 SEQ ID NO: 12 

SEQ ID NO: 5 SEQ ID NO: 13. 

SEQ ID NO: 6 SEQ ID NO: 14 

SEQ ID NO: 7 SEQ ID NO: 15 

SEQ ID NO: 8 SEQ ID NO: 16 

(b) subjecting a known molar quantity of the coupled complex from (a) to 
electrophoretic separation in parallel SDS-PAGE gel lanes with protein extracts 
containing members of the target protein family, followed by electrophoretic transfer to 
a membrane 

(c) immunodetection using a global antibody produced according to the above- 
defined method of the coupled standard and any members of the target protein family 
identified by the peptide sequence tag and 

(d) using an immunological signal from the known molar quantity of coupled 
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complex as a standard for measuring the molar quantity of members of the target 
protein family present in the protein extracts. 

In a still further aspect, there is provided a method for quantitation of members 
of the target protein families in multiple samples using Enzyme-Linked Immunosorbent 
Assay kits using on characterized global antibodies produced according to the above- 
defined method and quantitation standards produced according to the method defined 
in the preceding paragraph. 

In another aspect, there is provided a method for eliciting production of 
monoclonal, transgenic or synthetic antibodies using peptide sequence tags produced 
according to the above defined method and standard immunological protocols. 

In a further aspect, there is provided a method for affinity screening of libraries 
of reagents to detect specific reagent binding to a peptide sequence tag produced 
according to the above defined method and immobilized on a support matrice; and 

testing reagents binding to the immobilized peptide sequence tag for affinity 
binding to members of the target protein families. 

In a still further embodiment, there is provided a method to use affinity binding 
using global antibodies produced according to the above-defined method to capture 
target proteins from complex mixtures for subsequent analyses of the specific sequences 
of target proteins present in the mixture using mass spectroscopy. 

In yet another embodiment, there is provided a method to use affinity binding 
using global antibodies produced according to the above defined method to capture and 
remove target proteins from complex mixtures to lower interference with detection and 
analyses of other less abundant protein classes in proteomics applications such as two 
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dimensional isoelectric focusing /SDS-PAGE and subsequent mass spectroscopic 
analyses of protein sequences. 

In generalterms, the inventors have designed a set of peptide targets or peptide 
sequence tags which elicit production of a set of antibodies for detecting key proteins 
involved in photosynthetic productivity. The inventors chose target protein families 
based on scientific interest and wide applicability. They have found and aligned 
sequences from public databases to detect peptide sequence tags of 6-25 amino acids 
which are conserved in all known members of the target protein family, and use 
bioinf ormatic analyses to determine if the conserved peptides are unique to the target 
protein family. The peptide sequence tags are assessed for potential immuriogenicity, 
feasibility of synthesis, solubility and stability; avoiding sequences that are targets for 
known or putative post-translational modification in proteins. Selected peptide 
sequence tags are synthesized, coupled to a carrier and used to elicit antibody 
production. The specificity and titre of the antibodies were then listed. A set of 
antibodies increases the utility of the system by allowing comprehensive detection of 
key molecules in a sample, population or community. The target protein families were 
selected based on their core roles in the biosphere and their interest and importance for 
environmental research, modelling, and monitoring. 

Public sequence databases were scanned for all published sequences of proteins 
in a given family (http://www.expasy.ch; http:/ / www.ncbi.nlm.nih.gov) The 
sequences of all published members of each target protein family were aligned (Corpet, 
1988). Peptide sequence tags of 6-25 amino acids were selected whose sequences are 
conserved in all known members of the target protein or sub-family. The peptide 
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sequence tags were tested to determine exclusivity to the target protein family using 
short-peptide BLAST searches of sequence databases (Altschul et al., 2001). The position 
of each potential peptide sequence tag in a given protein family was analyzed to ensure 
it is maintained in the mature proteins, and to avoid regions of the proteins known or 
suspected to undergo post-translational modifications such as glycosylation that could 
interfere with later antibody recognition of the mature proteins. The peptide sequence 
tags are screened for antigenic potential using peptide property prediction algorithms, 
and to assess their feasibility for synthesis, solubility and stability based on amino acid 
composition. In summary the chosen peptide sequence tags are conserved in all 
published members of the defined target protein family or subfamily, do not align 
significantly with members of other known protein families, and have acceptable 
predicted antigenic and synthesis properties. 

The selected peptide sequence tags are synthesized. The peptide sequence tags 
are then coupled to appropriate immunogenic carrier molecules, typically Keyhole 
Limpet Hemocy anin, and the complexes are used to elicit production of IgY antibodies 
in hens. The IgY fraction is separated from the eggs of the immunized hens and the 
fraction is screened using Enzyme-Linked Immunosorbent Assays (EUSA) for binding 
to the original peptide target. Each IgY production ran generates sufficient antibody for 
hundreds of thousands of immunodetections. Additional hens can be immunized to 
generate further comparable antibody preparations and for pooling of antibody 
preparations from multiple hens. 

Members of the target protein family are extracted from a range of species, 
separated by SDS-polyacrylamide gel electrophoresis, electrophoretically transferred to 
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membranes and immunoblotting is used to characterize the binding of the antibodies to 
a range of members of the target protein family. Antibody preparations with good 
target affinity but which show non-specific binding to other proteins are subjected to 
affinity purification followed by re-characterization to attempt to improve specificity. 

The novel antibodies can be applied to detect major proteins in a range of species, 
including uncharacterized species, with confidence that the detection affinity of the 
antibody is standard for all denatured members of the target protein family. Therefore a 
quantity standard protein from one species or a synthetic quantity standard can be used 
for comparative quantitation of members of the protein family from other species. 
DETAILED DESCRIPTION OF INVENTION 

Peptide sequence tags designed for eliciting production of global antibodies 
binding all members of defined protein families or subfamilies. 

In the following, all peptides are written according to convention from amino 
terminus to carboxy terminus using the standard single letter amino acid code. All 
peptides are based on alignments of protein sequences accessed through the NCBI 
(ncbi.nlm.nih.gov) and SwissProt (expasy.ch) public databases. Where present a lower- 
case "c" indicates a terminal cysteine not present in the original protein family but 
added for chemical coupling to the immunogenic carrier molecule, usually Keyhole 
Limpet Hemocyanin. Ait upper case terminal "C" represents a cysteine present in the 
original protein, but also used for chemical coupling to the immunogenic carrier 
molecule. 

1. PsbA: EVMHERNAHN FPLDc (SEQ ID NO:l) Photosystem II is the 
ultimate source of almost all biosynthetic reductant in the biosphere. The PsbA (Dl) 
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protein of Photosystem II is rapidly cycled under illumination in all oxygenic 
photobionts (Aro et al., 1993). Disruptions of PsbA cycling or losses of PsbA pools are 
central to loss of Photosystem II function and consequent photoinhibition of 
photosynthesis in cyanobacteria, algae and plants tinder a wide range of conditions 
including excess light, low temperature and UV exposure (e.g. Bouchard et al., 2002; 
Campbell et al., 1998). Tracking PsbA pools using the global PsbA antibody elicited by 
the PsbA peptide sequence tag can show the functional content of Photosystem II in a 
wide range of samples. 

This PsbA peptide sequence tag is absolutely conserved in the PsbA proteins 
from almost all known oxygenic phptoautotrophs, with only minor variants found in 
some liverworts. The global antibody raised against this PsbA peptide sequence tag has 
to date been demonstrated to specifically recognize the PsbA protein from a wide range 
of species including plants, red algae, cyanobacteria, green algal lichens and a mixed 
natural phytoplankton community. For example the antibody is being applied to a 
biological oceanography project to study UV acclimation in natural phytoplankton at 
sites from the Arctic to the Antarctic (Bouchard et al., 2002), and also to a study of 
seasonal acclimation in lichens (Schofield et al., 2002) 

2. RbcL: CLRGGLDFTK DDENINS (SEQ ID NO:2) RbcL (RUBISCO) is the 
catalytic subunit of the primary carbon dioxide fixation enzyme in the biosphere and is 
present in all photobionts, along with many other prokary otic organisms that fix carbon 
through chemoautotropic mechanisms. The kinetic properties of RbcL are well 
characterized and the activity of RbcL limits total carbon dioxide uptake by many 
communities (e.g. Badger & Andrews, 1987; von Cammerer & Quick, 2001). The enzyme 
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has a low turnover rate (low kcat) but because the total flux of carbon fixation through 
the enzyme is large in photosynthetic organisms, the enzyme accumulates to high 
concentrations (e.g. 5-10% of extractable protein in cyanobacteria). It is thus a major sink 
for nitrogen and protein resources in photosynthetic organisms, and is indeed the most 
abundant protein on earth and a major protein source in the human diet, either directly 
through consumption of green plants or through contributions to forage feed for 
animals. Quantitating RbcL thus shows the total capacity for carbon uptake in a sample 
or community. This RbcL peptide sequence tag is diagnostic of the Type I sub-class 
of RUBISCO found in almost all oxygenic photoautotrophic organisms with the 
exception of dinoflagellates and the marine prochlorophyte Prochlorococcus. This RbcL 
peptide sequence tag is absolutely conserved in all known sequences from 
cyanobacteria, green algae, liverworts, mosses, conifers, eudicots, and monocots. The 
RbcL peptide sequence tag is conserved perfectly in some species, but shows minor 
variants in some species of ferns, euglenoids, gamma-proteobacteria, beta- 
proteobacteria, alpha-proteobacteria. It is present but imperfectly conserved in red 
algae, diatoms, cryptomonads, haptophytes and brown algae. The global antibody 
raised against this RbcL peptide sequence tag has to date been demonstrated to 
specifically recognize the RbcL protein from a wide range of species including 
cyanobacteria, green algal lichens, various plants and a mixed phytoplankton 
community dominated by diatoms. 

IN THE DRAWING 
The accompanying drawing shows the results of an immunoblot 
chemiltiminescent detection of RbcL protein in total protein extracts from (a) an elm 
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tree, (b) cyanobacterium (Synechococcus sp. PCC 7942), (c) marsh grass (Spartina) and (d) 
mixed population of marine phytoplankton from the Gulf of St. Lawrence, dominated 
by diatoms. 

Total denatured protein extracts from the four samples were separated by SDS- 
PAGE and electrophoretically transferred to hydrophobic membrane. The membrane 
was washed with a 1:4000 dilution of the global RbcL IgY antibody fraction (non-affinity 
purified) using standard immunoblotting procedures and solutions (Ausubel et al., 
1997). The Global RbcL antibody was then detected using a commercial secondary goat 
anti-chicken IgY antibody conjugated to a horse radish peroxidase enzyme label. 
Finally, the areas with bound horse radish peroxidase were detected using ECL+ 
(Amersham Pharmacia) chemiluminescent 

The drawing illustrates the broad detection range and examples of the three 
main utilities of the new global antibodies raised against peptide sequence tags; namely 
(a) detection of a major protein from organisms (elm and Spartina) which are relatively 
uncharacterized at the molecular level but which are of ecological interest, (b) detection 
of the same protein from a widely studied model species, the cyanobacterium 
Synechococcus, and (c) detection of the same protein family from a mixed phytoplankton 
community. 

Application (c) is part of a study of natural phytoplankton responses to changing 
UVB (Bouchard et al., 2002), where both the absolute level of the target protein and the 
community structure change under UVB exposure, necessitating an antibody with even 
detection efficiencies for all members of the target protein family. 
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3. GlnA: cTNSYKRLVP G (SEQ ID NO:3) GlnA or glutamine synthetase is 
the primary point for assimilation of inorganic ammonia nitrogen into the biosphere. 
During nitrogen assimilation all nitrogen sources are converted to ammonia, no matter 
what the original source, and then assimilated predominately via the activity of 
glutamine synthetase. Thus tracking levels of glutamine synthetase shows the metabolic 
capacity of a sample or community for total nitrogen assimilation . 

This GlnA peptide sequence tag shows perfect to high conservation in alpha, beta 
and gamma proteobacteria, enterobacteria, most cyanobacteria, thermotogales, low GC 
gram+, euryarchaeotes and crenarchaeotes. It shows moderate conservation with: 
aquificales, high GC gram+ (Streptomyces) and Trichodesmium thiebautii (a marine 
cyanobacteria). 

The GlnA peptide sequence tag shows weak and sporadic conservation with 
glutamine synthetase Type III (GlnN) and with some glutaminyl-tRNA synthetases 
(glutamine-tRN A ligase) (GLNRS), but antibodies raised against this peptide sequence 
tag are not expected to detect these enzymes. This peptide sequence tag shows no 
conservation with any eukaryotic GlnA, and therefore does not react with glutamine 
synthetases from eukaryotic sources. The global antibody raised against this GlnA 
peptide sequence tag has to date been demonstrated to specifically recognize the GlnA 
protein from several species of cyanobacteria. 

4. NifH: VESGGPEPGV GC (SEQ ID NO: 4) The NifH subunit is a 
component of the unstable nitrogenase enzyme system responsible for biological 
fixation of N2 to assimilable ammonia. Levels of the NifH protein can be used to track 
the total potential metabolic capacity for nitrogen fixation in any sample or community. 
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This NifH peptide sequence tag is perfectly or near-perfectly conserved in NifH 
proteins from all known organisms including: alpha, gamma, beta proteobacteria, 
enterobacteria, cyanobacteria, low GC gram+ bacteria, high GC gram+ bacteria, 
euryarchaeotes. 

5. PsaA: cHFSWKMQSD VW (SEQ ID NO: 5) PsaA is a core subunit of 
Photosystem I, a key complex involved in transduction of light to chemical energy in all 
oxygenic photobionts. Photosystem I participates in both linear and cyclic electron 
transport in photoautotrophic organisms. The molar ratio between Photosystem II and 
Photosystem I varies widely between taxa and under different environmental 
conditions (Falkowski & Raven, 1997), and is an important factor for inferring the 
acclimation state and photosynthetic performance of an organism or a community. This 
PsaA peptide sequence tag is specific to the sequence of the PsaA core protein of 
Photosystem I frbm all known photoautotrophic organisms, with the exception of a 
single amino acid mismatch at the third position in the dinoflagellate Heterocapsa 
triquetra 

6. NirB: HWTGCPNSC (SEQ ID NO: 6) NirB or nitrite reductase catalyzes 
the reduction of nitrite to ammonia, which is an obligatory intermediary step in 
assimilation of inorganic nitrate into the biosphere. Nitrate is the dominant source of 
inorganic nitrogen supporting primary productivity in most ecosystems and hence 
tracking NirB levels show the metabolic capacity for assimilation of this key nitrogen 
source involved in eutrophication, agricultural run-off and stimulation of algal blooms 
including harmful (toxic) algal blooms. 
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7. RbcL185: KPKLGLSc (SEQ ID NO: 7) This peptide, sequence tag is 
conserved in both Type I and Type II RbcL and hence can be applied to raise antibodies 
that will recognize both classes of RUBISCO enzyme, including the RUBISCO found in 
dinoflagellates and the zooxanthellae symbionts of coral. 

8. RbcL185a: KPKLGLSGKN YGRc (SEQ ID NO: 8)This peptide sequence 
tag is conserved in Type I RUBISCO and could be applied to generate a second anti- 
RUBISCO antibody for use in ELIS A sandwich assays. 

9. RbcLllS: DLFEEGSc (SEQ ID NO: 9) This peptide sequence tag is 
conserved in Type I RUBISCO and could be applied to generate a second anti-RUBISCO 
antibody for use in ELISA sandwich assays. 

10. NarB: IFAEVGRRLG F (SEQ ID NO: 10) This peptide sequence tag is 
specific to the nitrate reductase (NarB) enzyme from cyanobacteria, a key enzyme in 
nitrate assimilation. 

11. NifDMo: VSQSLGHHIAND (SEQ ID NO: 11) This peptide sequence tag 
is specific to the NifD subunit of the sub-set of nitrogenases with an iron/molybdenum- 
based co-factor (as opposed to iron/ vanadium or pure iron cof actors). 

12. NifKMo: CTTCMAEVIG DDL (SEQ ID NO: 12) This peptide sequence 
tag is specific to the NifK subunit of the sub-set of nitrogenases with an 
iron/molybdenum-based co-factor (as opposed to iron/ vanadium or pure iron 
cofactors). 

13. NifKMo: CMAEVIGDDL (SEQ ID NO: 13) This peptide sequence tag is 
an alternate target specific to the NifK subunit of the sub-set of nitrogenases with an 

14 

SUBSTITUTE SHEET (RULE 26) 



I 

WO 03/016904 PCT/CA02/01249 

iron/molybdenum-based co-factor (as opposed to iron/ vanadium or pure iron 
cofactors). 

14. PsbAl: GRQWELc(SEQ ID NO: 14) This peptide sequence tag is specific 
to cyanobacterial PsbAl, a form of Psb A expressed in acclimated cyanobacteria, but not 
in eukaryotic photobionts (plants and algae). Monitoring this protein can thus track the 
contribution of acclimated cy anobacteria to Photosystem II light energy conversion in a 
mixed community. 

15. PsbA2: GREWELc(SEQ ID NO: 15) This peptide sequence tag is specific 
to cyanobacterial PsbA2, a form of PsbA expressed only in cyanobacteria experiencing 
excitation stress or UVB stress (e.g. Campbell et al., 1998). Monitoring this protein can 
thus track when a cyanobacterial population is under excitation or UVB stress. It is also 
specific to the sole constitutive form of PsbA in eukaryotic photobionts (plants and 
algae). 

16. PsaB: FPCDGPGRGG TC (SEQ ID NO: 16) This peptide sequence tag is 
specific to the PsaB core protein of Photosystem I, a key complex involved in 
transduction of light to chemical energy in all oxygenic photobionts. 
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I Claim: 

1. A method for detecting the presence of a member of a target protein family in a 
sample comprising the steps of: 

(a) identifying and obtaining a peptide sequence tag conserved for all sequences of 
members of the target protein family, and exclusive to the target protein family; 

(b) assessing ihe tag for immimogenicity potential; . 

(c) utilizing the tag to elicit the production of antibodies; and 

(d) using the antibodies to measure the concentration of a member of the target 
protein family in a sample 

2. The method of claim 1, wherein, following assessing of the tag, the tag is 
synthesized. 

3. A method of claim 1, wherein following assessing the tag, the tag is assessed for 
stability, solubility and absence of regions of post-translational modification. 

4. The method of claim 1, wherein, following synthesis, the tag is. coupled to an 
immimogenic carrier molecule, and the resulting complex is used to elicit production of 
antibodies. 

5. The method of claim 1, wherein the peptide sequence tag is selected from the 
group consisting of 

SEQ ID NO: 1 SEQ ID NO: 9 
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SEQIDNO: 2 


SEQIDNO: 10 


SBQIDNO: 3 


SEQIDNO: 11 


SEQIDNO: 4 


SEQIDNO: 12 


SEQIDNO: 5 


SEQIDNO: 13 


SEQIDNO: 6 


SEQIDNO: 14 


SEQIDNO: 7 


SEQIDNO: 15 


SEQIDNO:8 


SEQIDNO: 16 



6. A method of preparing a peptide sequence tag for use in the production of global 
antibodies comprising the steps of: 

(a) identifying and obtaining a peptide sequence conserved for all members 
of and exclusive to a protein family; 

1>) assessing the tag for immunogenicity; and 

(c) synthesizing the tag provided it possesses a predetermined level of 
immunogenicity. 

7. A peptide sequence tag selected from the group consisting of 
SEQ ID NO: 1 SEQ ID NO: 9 

SEQIDNO: 2 SEQIDNO:10 
SEQ ID NO: 3 SEQ ID NO: 11 

SEQIDNO: 4 SEQ ID NO: 12 

SEQIDNO: 5 SEQIDNO:13 
SEQIDNO: 6 SEQIDNO:14 
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SEQ ID NO: 7 SEQ ID NO: 15 

SEQIDNO: 8 SEQ ID NO: 16 

8. A method of using a peptide sequence tag for coupling to column matrice 
materials for affinity purification of the global antibodies produced according to the 
method of claim 1. 

9. A method of developing characterized concentration standards for quantitation 
of the concentration of target proteins in samples by comparison to the concentration 
standards comprising the steps of: 

(a) coupling.a defined molar quantity of protein carrier molecule to a defined molar 
quantity of a peptide sequence tag selected from the group consisting of 



SEQIDNO: 1 


SEQ ID NO: 9 


SEQIDNO: 2 


SEQIDNO: 10 


SEQIDNO: 3 


SEQIDNO: 11 


SEQIDNO: 4 


SEQ ID NO: 12 


SEQIDNO: 5 


SEQ ID NO: 13 


SEQIDNO: 6 


SEQ ID NO: 14 


SEQIDNO: 7 


SEQ ID NO: 15 


SEQIDNO: 8 


SEQ ID NO: 16 



(b) subjecting a known molar quantity of the coupled complex from step (a) to 
electrophoretic separation in parallel SDS-PAGE gel lanes with protein extracts 
containing members of the target protein family, followed by electrophoretic transfer to 
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a membrane; 

(c) immunodetection using a global antibody produced according to the method of 
claim 1 of the coupled standard and any members of the target protein family identified 
by the peptide sequence tag; and 

(d) using an immunological signal from the known molar quantity of coupled 
complex as a standard for measuring the molar quantity of members of the target 
protein family present in the protein extracts. 

10. A method for quantitation of members of target protein families in multiple 
samples using Enzyme-Linked Immunosorbent Assay kits using on characterized 
global antibodies produced according to the method of claim 1 and quantitation 
standards produced according to the method of claim 9. 

11. A method for eliciting production of a monoclonal, transgenic or synthetic 
antibody using a peptide sequence tag produced according to the method of claim 6 and 
standard immunological protocols. 

12. A method for affinity screening of libraries of reagents to detect a specific reagent 
binding to a peptide sequence tag produced according to the method of claim 6 and 
immobilized on a support matrix; and 

testing reagents binding to the immobilized -peptide sequence tag for affinity 
binding to members of target protein families. 

20 



SUBSTITUTE SHEET (RULE 26) 



I f 



WO 03/016904 PCT/CA02/01249 

13. A method to use affinity binding using global antibodies produced according to 
the method of claim 1 to capture target proteins from complex mixturesi or subsequent 
analyses of the specific sequences of target proteins present in the mixture using mass 
spectroscopy. 

14. A method to use affinity binding using global antibodies produced according to 
the method of claim 1 to capture and remove target proteins from complex mixtures to 
lower interference with detection and analyses of other less abundant protein classes in 
proteomics applications such as two dimensional isoelectric focussing /SDS-PAGE and 
subsequent mass spectroscopic analyses of protein sequences. 
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Synechococcus 
Spartina (marsh grass) 
mixed phytoplankton 




Figure 1 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION . 

(i) APPLICANT : Campbell, Douglas A. 

(ii) TITLE. OF INVENTION: PEPTIDE SEQUENCE TAG 

(iii) NUMBER OF SEQUENCES: 16 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE : Stikeman, Elliott 

(B) STREET: 1600 - 50 O 1 Connor Street 

(C) CITY: Ottawa 

(D) PROVINCE: Ontario 

(E) COUNTRY: Canada 

(?) POSTAL CODE: KIP' 6L2 

(v) . COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATION SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: ASCII 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) ATTORNEY/ AGENT INFORMATION 

(A) NAME: Derenyi , Eugene 

(B) REGISTRATION NO. : P-52,409 

(C) REFERENCE/DOCKET NUMBER: 105297-1014 

<ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE : (613) 566-0544 

(B) TELEFAX: (613) 230-8877 

(2) INFORMATION FOR SEQ. ID NO si: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: peptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 
EVMHERNAHN FPLDC 



(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: peptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2 
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CLRGGLDFTK DDENINS 



(2) INFORMATION FOR SEQ ID NO: 3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE : peptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 
CTNSYKRLVP G 



(2) INFORMATION FOR SEQ ID NO : 4 : 
{ i ) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) YPE: peptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
VESGGPEPGV GC 



(2) INFORMATION FOR SEQ ID NO: 5: 
(i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 12 base pairs 

(B) TYPE : peptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
CHFSWKMQSD VW 



(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE : peptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
HWTGCPNSC 



(2) INFORMATION FOR SEQ ID NO: 7: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 base pairs 

(B) TYPE : peptide 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
KPKLGLSC 



(2) INFORMATION FOR SEQ ID NO: 8: 
( i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 14 base pairs 

(B) TYPE: peptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
KPKLGLSGKN YGRC 



(2) INFORMATION FOR SEQ ID NO : 9 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 base pairs 

(B) TYPE : peptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
DLFEEGSC 



(2) INFORMATION FOR SEQ ID NO: 10: 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 11 base pairs 

(B) TYPE : peptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
IFAEVGRRLG F 



(2) INFORMATION FOR SEQ ID NO: 11: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

( B ) TYPE : peptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
VSQSLGHHIA ND 



(2) INFORMATION FOR SEQ ID NO: 12: 
( i ) SEQUENCE CHARACTERISTICS : 
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(A) LENGTH: 13 base pairs 

(B) TYPE: peptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 
CTTCMAEVIG DDL 



(2) INFORMATION FOR SEQ ID NO: 13 : 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE : peptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
CMAEVIGDDL 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 base pairs 

(B) TYPE : peptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
GRQWELC 



(2) INFORMATION FOR SEQ ID NO : 15 : 
( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7 base pairs 

(B) TYPE : peptide 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GREWELC 



(2) INFORMATION FOR SEQ ID NO: 16: 
( i ) SEQUENCE CHARACTERI STICS : 

(A) LENGTH: 12 base pairs 

( B ) TYPE : peptide 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
FPCDGPGRGG TC 



SUBSTITUTE SHEET (RULE 26) 



